Quit Emailing Yourself

# diffusion → bert → language models

1 link tagged with all of: diffusion + bert + language models

BERT is just a Single Text Diffusion Step | nathan.rs

The article discusses the concept of using discrete language diffusion models for text generation, specifically highlighting how BERT's masked language modeling can be generalized into a diffusion framework. It explores the evolution from traditional models like BERT and GPT to the newer Gemini Diffusion model, and introduces the idea of transforming BERT's training objective into a generative process through variable masking rates. The author also notes the existence of related work, such as DiffusionBERT, which performs similar tasks with rigorous testing.

Saved by hn_user_10 · 1 other saved this · Last saved October 28, 2025 · 3 min read

bert ✓ diffusion ✓ language models ✓

Links

BERT is just a Single Text Diffusion Step | nathan.rs